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SYSTOLIC  ARRAY  FAULT  TOLERANCE  PERFORMANCE  ANALYSIS 


INTRODUCTION 

Systolic  array  technology  offers  increased  throughput 
using  algorithm-efficient  architectures  implemented  with 
state-of-the-art  very  large  scale  integration  (VLSI) 
technology  (references  1-4).  Several  architectural  issues 
must  be  addressed  before  this  technology  can  transition  from 
the  laboratory  environment  to  systems  in  the  field.  One  of 
these  issues  is  reliability.  Since  the  systolic  array 
approach  is  hardware  intensive,  increased  throughput  is 
accompanied  by  deteriorated  reliability  performance. 
Therefore,  fault  tolerance  design  concepts  must  be 
incorporated  into  systolic  array  architectures  to  restore 
the  lost  reliability  performance. 

The  application  of  fault  tolerance  techniques  involves 
four  basic  areas:  error  detection,  error  correction,  fault 
isolation,  and  system  reconfiguration.  A  detailed  discussion 
of  all  four  aspects  of  fault  tolerance  is  beyond  the  scope 
of  this  report;  therefore,  the  primary  focus  will  be  on 
system  reconfiguration  performance.  The  objective  of  the 
study  reported  is  to  identify  the  reconfiguration  techniques 
which  will  provide  the  most  dependable  performance  for 
systolic  array  architectures.  Once  identified,  these  fault 
tolerant  architectures  can  establish  the  framework  for  error 
detection,  error  correction,  and  fault  isolation.  If 
dependable  performance  cannot  be  achieved  through 
reconfiguration,  any  effort  placed  in  detection,  isolation, 
and  correction  would  be  misdirected. 

Systolic  array  reliability  performance  will  be  compared 
for  several  fault  tolerance  techniques  by  providing 
estimates  of  the  mean  time  between  failure  ( MTBF ) ,  a 
universal  reliability  performance  attribute.  Spatially  and 
temporally  fault  tolerant  techniques  will  be  examined. 
Spatial  fault  tolerant  techniques  use  hardware  reserves  to 
compensate  for  component  failures.  Temporal  fault  tolerance 
techniques  exploit  graceful  degradation  in  algorithm 
execution  time  and/or  system  performance  to  offset  component 
failures  and,  thus,  mitigate  the  initial  hardware  reserve 
level . 

The  MTBF  is  used  as  the  performance  measure  since  it  is 
a  distinctive  requirement  for  military  systems  and  is  used 
to  estimate  availability,  logistics,  and  life  cycle  costs. 
Improvement  in  reliability  is  measured  through  the 
comparison  of  the  MTBF  of  the  fault  tolerant  systolic  array 


1 


and  the  MTBF  of  the  systolic  array  without  a  fault  tolerant 
architecture.  The  acquired  improvement  will  be  compared  to 
the  additional  reserve  hardware  required  for  a  given  fault 
tolerant  approach.  For  example,  the  enhancement  for  25%, 
50%,  75%,  and  100%  redundancy  levels  will  be  assessed  where 
appropriate.  This  comparison  will  help  assess  the  cost 
effectiveness  of  a  fault  tolerant  architecture. 


SYSTOLIC  ARRAY  RELIABILITY 

The  reliability  of  a  circuit  component  is  the 
probability  of  failure-free  performance  for  a  required 
function,  under  stated  conditions,  for  a  given  period  of 
time.  Since  it  is  impossible  to  categorically  state  that  a 
component  will  perform  properly,  the  most  that  can  be  done 
is  to  calculate  or  measure  a  probability  for  successful 
operation.  Reliability  screening  procedures  can  reduce  the 
instantaneous  failure  rate,  or  hazard  rate,  of  a  component 
to  a  constant  value  over  its  useful  lifetime  so  its  overall 
reliability  can  be  expressed  by  the  following  exponential 
function  (references  5  and  6) : 

R  (t )  =  e-^, 

where 

R ( t )  =  the  probability  that  a  component  will  function 
properly  (component  reliability) , 
t  =  time,  and 

Xc  =  component  instantaneous  failure  rate 
(hazard  rate)  . 

The  reliability  performance  of  a  component  is  commonly 
expressed  in  terms  of  its  mean  time  between  failure.  The 
MTBF  of  a  component  can  be  established  by  integrating  the 
reliability  function,  R(t),  over  all  time  so  that 

OO 

MTBF  =  J  R(t)  dt. 

0 

Systolic  arrays  reach  high  computational  rates  because 
algorithms  can  be  projected  onto  interconnected  arrays  of 
processing  elements.  The  maximum  theoretical  throughput  is 
linearly  related  to  the  quantity  of  processing  element:  in 
the  systolic  array.  Theoretical ly,  N  processing  elements, 
combined  in  an  array,  will  provide  N  times  the  throughput  of 
a  single  processing  element.  By  definition,  the  systolic 
array  method  is  hardware  intensive. 
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The  reliability  of  a  systolic  array  designed  without 
fault  tolerant  considerations  depends  on  the  probability  of 
each  and  every  processing  element  remaining  functional. 
This  condition  generates  a  serial  reliability  model.  The 
reliability  of  a  systolic  array  having  identical  processing 
elements  is 

Rsystolic*^  =  e-^PE2t  •••  e“*-PENt 

=  e-N^PEt  ;  for  ^PEi  =  ^pe2  ■ • •  =  ^pen 

=  rn. 

Likewise,  the  MTBF  of  a  systolic  array  with  identical 
processing  elements  is 


:  MTBFSystolic  =  /  e  N^PEt  dt  =  (NXpE)  _1  =  M. 

0 

(These  results  show  that  the  MTBF  of  a  systolic  array 
without  fault  tolerant  architectural  considerations  is 
inversely  proportional  to  its  maximum  theoretical  throughput 
or  N.  An  increase  in  computational  speed  will  be  offset  by 
a  corresponding  decrease  in  reliability,  when  expressed  in 
I  terms  of  MTBF.  Therefore,  reliability  and,  consequently, 

*  fault  tolerance  should  be  a  major  consideration  in  the 

i  design  of  embedded  systolic  arrays.  Ideally,  the  enhancement 

)  provided  by  fault  tolerant  techniques  should  be  proportional 

*  to  N  to  offset  the  degradation  in  MTBF.  Furthermore,  the 

(improvement  needed  in  MTBF  would  actually  be  greater  than  N 
when  less  than  100%  computational  efficiencies  are 
considered  (references  7  and  8) . 

i 

i 

i 
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FAULT  TOLERANT  SYSTOLIC  ARRAYS 

The  object  of  the  systolic  array  fault  tolerance 
approaches  described  in  this  report  is  to  fail-operational 
(reference  9) .  A  fail-operational  mode  will  ensure  that  the 
systolic  array  operates  correctly  in  the  presence  of  faults 
(component  failures).  Thus,  component  failures  will  not 
precipitate  an  unacceptable  reduction  in  system  performance. 
For  example,  an  application  may  require  the  systolic  array 
to  be  capable  of  performing  matrix  operations  even  though  a 
processing  element  (s)  has  failed.  There  are  two  ways  a 
systolic  array  can  fail  operational:  (1)  to  encounter  the 
failure  and  provide  the  maximum  required  performance  and  (2) 
to  encounter  the  failure  with  an  acceptable  level  of 
graceful  degradation  in  performance.  Both  techniques 
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necessitate  a  hardware  reserve  and/or  architectural 
reconfiguration  capability. 

Spatial  redundancy  and  temporal  redundancy  are  two 
generic  approaches  for  fault  tolerance.  Spatial  redundancy 
capitalizes  on  additional  hardware  to  provide  the  same 
throughput  (maximum  array  size)  when  a  failure  occurs. 
Temporal  redundancy  eases  the  required  throughput  when  a 
failure  occurs  and  provides  graceful  degradation  in  system 
performance.  Either  the  same  amount  of  information  is 
processed  in  a  longer  time  period,  or  less  information  is 
processed  in  the  same  time  period. 

Spatially  redundant  methods  yield  optimal  system 
performance,  but  at  the  expense  of  expanded  physical  size 
and  monetary  cost.  The  spatial  redundancy  methods  maintain 
system  performance  using  oversized  arrays.  Six  spatial 
redundancy  methods  will  be  discussed  in  this  report.  Four 
methods  are  generic  and  can  be  used  for  any  systolic  array 
application  and  the  other  two  methods  are  based  on  specific 
architectural  configurations.  The  first  and  most  obvious 
method,  redundant  arrays,  is  the  inclusion  of  one  or  more 
alternate  systolic  arrays  in  the  system.  If  the  primary 
array  fails  an  alternate  array  is  switched  into  the  system. 
The  second  method,  companion  processors,  also  uses  a  fairly 
straightforward  concept  by  providing  redundancy  at  the 
processing  element  level  instead  of  the  array  level.  Each 
processing  element  has  one  companion  which  can  be  interposed 
when  one  element  has  failed.  The  third  and  fourth  methods 
provide  redundancy  by  switching  a  group  of  failed  processing 
elements  with  a  group  of  functioning  processing  elements. 
The  two  methods  identified  in  literature  have  been  referred 
to  as  sequential  row  elimination  (SRE)  and  alternate  row  and 
column  elimination  (ARCE) ,  which  is  contingent  upon  a 
rectangular  systolic  array  configuration.  The  fifth  method 
uses  reserve  hardware  with  dynamic  virtual  channel 
reconfiguration.  The  virtual  channel  method  is  the  most 
sophisticated  and  complex  approach  to  systolic  array  fault 
tolerance.  The  final  method  provides  spatial  redundancy 
specifically  for  tree  architectures. 

Redundant  hardware  fault  tolerance  replicates  common 
hardware  components.  The  spatial  fault  tolerance  approach 
uses  hardware,  i.e.,  circuit  cards,  that  are  identical  in 
design  and  function  as  well  as  construction.  This  require¬ 
ment  is  especially  important  in  systems  where  spare  parts, 
life  cycle  costs  and  logistics  is  of  the  utmost  importance. 

One  systolic  array  application,  adaptive  beamforming, 
is  novel  in  that  an  inherent  temporal  fault  tolerance 
concept  is  built  into  the  algorithm.  The  minimum  variance 
distortionless  response  (MVDR)  algorithm  exemplifies  this 
concept.  The  MVDR  algorithm  allows  for  beamforming 
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subarrays  instead  of  hydrophones.  Therefore,  when  a 
component  failure  occurs  and  reduces  the  available  systolic 
array  size,  all  the  hydrophone  inputs  can  still  be  processed 
by  making  the  subarrays  larger.  Larger  subarrays  will 
reduce  the  covariance  matrices  that  have  to  be  processed  to 
form  the  adaptive  weights.  The  hydrophone  array  will 
provide  the  same  array  gain  but  the  number  of  interference 
sources  that  can  be  cancelled  will  be  smaller.  Processing 
time  and  array  gain  can  be  maintained  at  the  expense  of  the 
reduction  in  degrees  of  freedom.  For  large  sonar  arrays 
this  approach  may  be  beneficial.  Like  spatial  fault 
tolerance,  the  subarray  fault  tolerance  approach  does 
require  the  ability  to  reconfigure  the  array.  In  fact, 
fault  tolerance  through  graceful  degradation,  necessitates 
the  use  of  one  of  the  previously  discussed  spatially 
redundant  techniques.  SRE,  ARCE,  virtual  arrays,  or  tree 
architectures  can  be  used  in  a  graceful  degradation  mode. 
However,  the  application  of  graceful  degradation  would 
reduce  hardware  reserve  requirements  significantly.  The 
subarray  method  is  the  only  temporally  redundant  approach 
that  will  be  discussed  in  this  report  because  of  its 
specific  application  to  adaptive  beamforming.  Proofs  and 
analysis  programs  for  each  approach  are  shown  in  the 
appendices  (A  and  B) . 


SPATIAL  FAULT  TOLERANCE 


REDUNDANT  ARRAYS 


The  redundant  array  method  is  illustrated  in  figure  1. 
This  spatially  redundant  method  includes  the  addition  of  an 
alternate  systolic  array  in  case  the  primary  array  fails. 
Accordingly,  a  minimum  of  100%  hardware  redundancy  is 
required.  If  one  alternate  array  is  used  the  reliability  is 


^Redundant <t>  =  R2N  +  2RN  (1  -  RN)  , 


with  an  associated  MTBF  of 


MTBFRedundant  =  M/2  +  2M  -  2M/2  =  1.5  M. 


The 


redundant  array  method  offers  a  fixed  improvement 
in  MTBF  of  50%  over  the  systolic  array  MTBF,  while  doubling 
the  total  hardware  required.  Ideally,  the  improvement 
should  increase  linearly  with  the  array  size,  N,  thereby 
negating  the  diminished  systolic  array  MTBF.  A  linearly 
related  improvement  would  yield  a  reliability  performance 
that  is  independent  of  array  size  and  would  be  a  design 
objective  for  optimal  performance.  From  a  technical 
standpoint,  the  redundant  array  method  is  neither  cost 
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effective  nor  desirable.  The  linear  reduction  in 
MTBFsystolic  due  to  array  size  has  not  been  alleviated. 


COMPANION  PROCESSORS 

Like  the  redundant  array  method,  the  companion 
processor  fault  tolerance  method  entails  a  hardware 
duplication.  However,  the  companion  processor  method 
inserts  redundancy  at  the  processing  element  level  rather 
than  at  the  array  level,  as  depicted  in  figure  2.  If  any 
single  processing  element  fails  it  is  replaced  with  its 
companion  processor.  This  fault  tolerance  method  offers  the 
advantage  of  an  elementary  switch  implementation.  The 
switching  function  can  be  fulfilled  by  tri-state  buffers  at 
the  output  ports  of  each  processing  element.  This  type  of 
switch  mechanism  would  have  a  low  failure  rate,  since 
minimal  hardware  is  required  to  implement  it .  The 
disadvantage  of  this  method  is  that  a  given  reserve 
processing  element  can  only  be  used  to  replace  its  single 
companion  in  the  array.  Consequently,  a  systolic  array  can 
fail  even  though  reserve  processing  elements  are 
operational . 

The  reliability  for  the  companion  processor  method  is 
^Companion )  =  e"NXCSfc  [R2  +  2  R  (  1  -  R)]N. 

=  RNF  (2  -  R)N, 

where 

^■QS  =  failure  rate  for  the  companion  switch, 

F  =  (1  +  1/sr)  ,  a  measure  of  the  failure  rate  of 

the  switch  compared  to  the  failure  rate  of  the 
processing  element,  and 

sr  =  switch  ratio  (^pE/XCs)  ' 

using  the  binomial  theorem  (reference  10)  to  expand  the 
exponential  term,  the  reliability  can  also  be  expressed  as 


where 


After  integrating  Rcompanion (t ) •  the  MTBF  is  found  to 
be 

N 

MTBFCompanion  -  M  I  /NW*"1*  (-1)1  [N/(i+NF)]- 

i-0 


The  MTBF  for  the  single  systolic  array,  redundant 
array,  and  companion  processor  fault  tolerance  methods  are 
sketched  in  figure  3  in  terms  of  single  processing  element 
MTBF  versus  array  size.  The  companion  processor  data  were 
generated  under  the  assumption  that  the  switch  mechanism  is 
ideal,  i.e.,  the  switch  failure  rate  is  zero,  or  F=l. 

The  companion  processor  method  produces  the  following 
improvement  for  a  systolic  array: 


N 

Companion  =  ^  /N\  2  <N  (-1)1  [N/  (i+NF)  ]  • 

i-0  Vi/ 

The  improvement  versus  array  size  is  shown  for  various 
failure  rate  ratios  in  figure  4.  MTBFcompanion  depends  on 
the  ratio  of  the  processing  element  failure  rate  to  the 
switch  failure  rate.  The  improvement  was  calculated  for 
several  failure  rate  ratios.  The  outcome  shows  that  when 
the  switch  failure  rate  is  comparable  to  the  processing 
element  failure  rate,  F  =  2,  little  or  no  improvement  in 
MTBFcompan ion  is  encountered  compared  to  the  dual  array 
approach.  A  switch  ratio  of  40  or  greater,  F  =  1,  will 
bring  the  improvement  near  the  theoretical  maximum,  which 
ideally  occurs  when  A.qS  =  0.  It  is  beneficial  to  note  that 
the  maximum  theoretical  improvement  approximates  the  square 
root  of  the  number  of  processing  elements.  The  improvement 
is  better  than  the  redundant  array  method  but  still  dees  not 
provide  a  linear  compensation  for  the  reduction  in 
MTEFgy  s «.  0  ]_  j,  c  with  array  size.  The  hardware  is  doubled  for 

nearly  a  (N)^'2  improvement  in  MTBF. 
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SEQUENTIAL  ROW  ELIMINATION  (SRE) 


Reconfiguration  using  reserve  hardware  can  occur  at  the 
processing  element  level  or  at  a  level  that  consists  of 
groups  of  processing  elements.  Group  fault  tolerance 
techniques  include  the  SRE  and  ARCE  methods  presented  by  C. 
S.  Raghavendra  (reference  11)  . 


The  SRE  and  ARCE  methods  achieve  fault  tolerance  by 
successively  removing  rows,  or  rows  and  columns,  in  an  array 
that  has  faulty  processors.  Programmable  switches  and 
interconnections  facilitate  the  removal  of  a  faulty  row  or 
column.  The  SRE  method  will  be  addressed  in  this  section 
as  illustrated  in  figure  5. 


The  SRE  fault  tolerance  substitutes  faulted  rows  of 
processing  elements  with  good  rows  when  one  processing 
element  in  a  row  has  failed.  The  switching  function 
controls  the  exclusion  of  an  entire  row.  For  the  SRE  method 
the  reliability  is 


R  (t )  - 


<-n2/sr)t 


n  (Aj  -  Ai) 
j=0 


where 


=  number  of  elements, 

=  switch  ratio  (XPEAswitch)  > 

=  n  ( n  -  j )  X, 

=  n  ( n  -  i )  X, 

=  Maximum  #  of  rows  -  Minimum  #  of  rows,  and 
=  counting  variable  (0  ...  D) . 


The  MTBF  improvement  was  determined  by  integrating  the 
reliability  function  over  time.  Figure  6  shows  the  effect  of 
array  size  and  switch  failure  rate  on  MTBF  improvement. 
Square  array  architectures  were  postulated.  Notice  the 
switches  used  in  this  case  are  affiliated  with  an  entire  row 
rather  than  one  single  processing  element. 


ALTERNATE  ROW  AND  COLUMN  ELIMINATION  (ARCE) 


The  ARCE  method  switches  out  a  row  or  a  column  when  a 
processing  element  fails.  This  elimination  is  based  on  an 
alternating  removal  concept.  The  hardware  is  shown  in 
figure  7.  The  reliability  for  ARCE  is 


R(t) 


where 


D 

e  I 

k=0 

n  (Aj  -  ai) 

j-0 

j*i 

Aj  =  <n  -  [3/2!)  (n  -  L^/sJ) X 

Ai  =  (n  -  ri/2l)  (n  -  Li/2J)^.,  and 

D  =  Maximum  #  of  rows  &  columns  -  Minimum 
#  of  rows  &  columns . 


k 

I 

i=0 


k-1 

n  Aj 

j-0 


"Ait. 


Once  again,  the  reliability  was  integrated  over  all 
time  to  get  the  improvement  in  MTBF .  Since  the  arrays  were 
assumed  square,  the  hardware  reserve  percentages  varied. 
Table  1  shows  the  results  for  up  to  125%  redundancy  levels. 
Switch  ratios  (sr)  from  1  to  100,000  (an  ideal  switch)  are 
encompassed . 

The  ARCE  method  limits  architectural  configurations  to 
square  or  rectangular  arrays.  The  companion  processor 
method  demonstrates  a  higher  MTBF  and  is  suitable  for  any 
architectural  configuration. 


VIRTUAL  CHANNEL  FAULT  TOLERANCE 

The  virtual  channel  technique  organizes  operational 
processing  elements  in  a  systolic  array.  Connections 
between  switch  ports  form  virtual  channels.  Figure  8 
portrays  the  virtual  channel  method  as  outlined  by  H.  T. 
Kung  and  0.  Menzilcioglu  (reference  12)  .  Unlike  other 
approaches  described  in  this  report,  the  virtual  channel 
method  utilizes  all  available  operational  processing 
elements.  The  redundant  array,  SRE,  and  ARCE  fault 
tolerance  methods  may  actually  drop  out  operational 
processing  elements  along  with  the  faulty  processing 
elements,  thus,  rendering  them  inactive.  The  virtual 
channel  technique  averts  this  situation. 
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•  4 


Ideally  the  virtual  channel  fault  tolerance  method 
provides  for  any  combination  of  operational  processing 
elements  given  any  combination  of  faulty  processing 
elements.  Applying  probability  theory  for  Bernoulli  trials 
(references  13  and  14)  the  reliability  for  the  virtual 
channel  method  with  maximum  coverage  is 

K 

RVirtual  (t)  =  e~  (N+K)^vst  Z  R(N+K-i)  (1_R)i, 

where 

N  =  the  number  of  functionally  required  processing 
elements, 

i  =  the  possible  number  of  failed  processing 
elements,  and 

A.vs  =  virtual  channel  switch  failure  rate. 

The  binomial  theorem  expands  the  equation  to 
K  i 

Rvirtual(t)  -  £  2  /NtKW1\  <-l>P  R ffN+K-i+p)  ^ 

i=0  p-0  \p  /  \p/ 

which  leads  to  an  MTBF  improvement  of 


IVirtual  - 


z  z 

i=0  p=0 


i\(-l)p  [N/(FN+K-i+p)  1  • 


Figures  9  through  12  show  the  improvement  expected  with 
several  switch  ratios  for  25%,  50%,  75%,  and  100%  hardware 
reserve  levels.  The  virtual  switch  failure  rate  is  crucial 
in  evaluating  the  overall  MTBF  for  the  systolic  array.  The 
virtual  array  switch  is  also  more  complex  than  the  switch 
used  in  the  companion  processor,  the  SRE,  or  the  ARCE  fault 
tolerance  methods,  and  is  expected  to  be  less  reliable. 

The  performance  for  the  virtual  channel  method  is 
limited  in  practice  by  the  array  size,  number  of  reserve 
processing  elements,  and  the  virtual  channel  reconfiguration 
combinations .  For  example,  a  systolic  array  with  40  required 
processing  elements  and  20  spares  would  have  to  account  for 
a  maximum  of  7.54  x  lO-'-O  possible  reconfiguration 
c : mb i nations  if  10  processing  elements  had  failed  (for 
complete  fault  coverage) .  For  that  reason,  the  improvement 
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for  larger  arrays  would  be  somewhat  less  than  the 
theoretical  predictions.  But,  for  smaller  systolic  arrays, 
the  performance  is  accurate  because  the  reconfiguration 
complexity  is  minimal.  Fault  coverage  must  be  integrated 
into  the  analysis  to  determine  practically  realizable 
improvement  levels.  Nevertheless,  the  virtual  channel  fault 
tolerance  method  does  have  the  theoretical  capacity  to 
provide  the  best  improvement  for  the  least  amount  of 
hardware  reserve. 

A  closer  observation  of  the  virtual  array  performance 
data  reveals  a  linear  theoretical  improvement.  Table  2 
presents  the  linear  approximations  for  the  maximum 
theoretical  improvement  curves  given  in  figures  9  through 
12.  The  best  improvement  requires  100%  hardware  redundancy 
but  only  diminishes  the  total  systolic  array  MTBF  to  30%  of 
the  MTBF  of  a  single  processing  element.  The  improvement  is 
0.7  (N).  The  virtual  array  fault  tolerance  technique  has  the 
theoretical  potential  to  provide  improvement  in  MTBF  that 
is  independent  of  array  size.  This  feature  was  not  achieved 
by  the  other  techniques  discussed  in  this  report. 


Table  2.  Virtual  Array  Linear  Approximations 


Redundacy 

Slope 

Y-intercept 

Equation 

(I  =) 

25  % 

0.2254 

1.0350 

0.236N  + 

1.10 

50  % 

0.4176 

0.2780 

0.418N  + 

1.05 

75  % 

0.5487 

1.2518 

0.572N  + 

1.03 

100  % 

0.7048 

0.7829 

0  . 70  3N 

+  1.03 

20 
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TREE  ARCHITECTURES 

Fault  tolerant  tree  architectures  can  be  designed  by 
interconnecting  a  redundant  tree  as  depicted  in  figure  13. 
Processing  elements  are  paired  so  an  operational  processing 
element  can  assume  the  computational  burden  of  a  failed 
neighbor  (reference  15)  .  This  idea  is  similar  to  the 
companion  processors.  In  fact,  the  MTBF  for  the  minimum 
throughput  will  be  the  same 

RTree  ^  =  e“N*TSt  [R2  +  2  R  (  1  -  R)  ]N 
=  RNF  (2  -  R)n, 

where 

^Ts  =  the  failure  rate  of  the  switch  used  to  switch 
processing  elements  in  the  tree  architecture. 

However,  the  tree  architecture  doubles  the  available 
throughput  of  the  companion  processor  method  when  failures 
have  not  occurred.  The  companion  processor  architecture 
does  not  allow  for  the  reserve  processing  elements  to  be 
used  until  a  failure  occurs.  The  tree  architecture  can 
provide  for  graceful  degradation.  Although  the  performance 
for  the  companion  processor  fault  tolerance  method  is 
applicable  for  tree  architectures,  care  must  be  exercised  to 
interpret  the  results  correctly.  For  the  companion 
processor  approach  N  is  related  to  both  the  minimum  and 
maximum  throughput  available  from  the  array.  Moreover,  N  is 
the  minimum  throughput  of  the  array,  but  the  maximum 
available  throughput  is  2N  for  tree  architectures. 


TEMPORAL  FAULT  TOLERANCE  THROUGH  GRACEFUL  DEGRADATION 

Graceful  degradation  fault  tolerance  techniques  allow 
reduced  performance  when  a  processing  element  has  failed. 
The  systolic  array  performs  the  identical  algorithm,  but 
with  less  throughput.  Graceful  degradation  can  be 
demonstrated  with  the  MVDR  beamforming  algorithm. 

The  MVDR  algorithm  can  be  implemented  in  a  subarray 
(SA)  mode  (reference  16)  .  In  the  SA  mode,  groups  of 
hydrophones  are  prebeamf ormed  conventionally.  These  SA 
outputs  are  input  to  the  adaptive  beamforming  algorithm  for 
the  entire  array.  The  computational  burden  is  trimmed  down 
from  N^  by  a  factor  of  ,  where  P  is  the  number  of 
hydrophone  elements  in  each  SA.  The  actual  computational 
load  is  [(N/P)^  p  +  (N/P)2  n].  For  example,  an  array  with 


100  elements  divided  into  SA's  consisting  of  2  elements  (50 
groups  of  2)  would  have  a  computational  load  that  is  25%  of 
the  load  for  the  full  array.  Memory  storage  requirements 
for  covariance  matrices  would  also  be  reduced  accordingly. 

The  SA  approach  sacrifices  degrees  of  freedom  for 
execution  time.  The  algorithm  is  executed  on  a  systolic 
array  with  less  than  optimal  throughput  capability  within 
the  same  execution  time.  But,  the  number  of  interfering 
sources  which  can  be  attenuated  by  the  adaptive  beamforming 
algorithm  is  smaller.  This  concession  may  be  acceptable  for 
large  arrays  with  many  hydrophone  sensors. 

A  fault  tolerance  method  which  exploited  the  SA 
approach  would  be  capable  of  reconfiguring  not  only  the 
hardware,  but  also  the  software  to  allow  for  the  addition  of 
conventional  beamforming  of  the  sensors  in  the  SA's. 
Algorithm  fault  tolerance  for  linear  system  solvers  has  been 
addressed  by  Luk  and  Park  (reference  17)  .  The  reliability 
performance  of  this  graceful  degradation  fault  tolerance 
technique  would  depend  on  system  requirements  including 
initial  size  of  the  transducer  array,  minimum  reconfigured 
throughput  of  the  systolic  array,  maximum  throughput  of  the 
systolic  array  and  maximum  number  of  interference  sources 
that  are  required  to  be  attenuated  by  the  adaptive 
beamformer.  These  system  requirements  would  be  integrated 
into  one  of  the  spatially  redundant  design  approaches 
described  earlier.  The  cost  of  implementing  a  given 
spatially  redundant  fault  tolerance  approach  is  mitigated 
because  of  the  reduction  in  hardware  reserves.  For  example, 
a  100  element  array  with  100%  redundancy  using  virtual 
channels  and  ideal  switches  would  have  an  MTBF  of  0.7/X.  If 
a  minimum  throughput  from  50  processors  was  acceptable  from 
a  100  element  array,  the  very  same  MTBF  would  be  reached 
with  50%  of  the  hardware.  The  intent  of  this  report  is 
merely  to  propose  this  fault  tolerant  approach  as  a 
possibility  for  systolic  array  fault  tolerance.  A  detailed 
assessment  of  the  application  of  this  graceful  degradation 
technique  is  a  topic  for  future  work  with  a  concentration  in 
the  area  of  specific  applications. 


CONCLUSIONS 


Systolic  array  reliability  performance  can  be  enhanced 
through  the  use  of  fault  tolerant  reconfiguration 
techniques.  Figure  14  points  out  the  difference  between  the 
MTBF  of  a  systolic  array  and  a  fault  tolerant  systolic  array 
(Virtual  method) .  The  ideal  theoretical  throughput  is  also 
shown  along  with  Amdahl's  throughput.  The  line  on  the  graph 
that  corresponds  to  Amdahl's  throughput  shows  the  actual 
throughput  that  can  be  expected  from  the  system.  Spatial 
and  temporal  redundancy  techniques  were  studied  and  their 
performance  quantified  in  terms  of  mean  time  between  failure 
(MTBF)  . 

The  results  of  the  investigation  clearly  show  that 
virtual  array  fault  tolerance  offers  the  best  theoretical 
improvement  for  high  throughput  requirements.  In  fact,  a 
constant  MTBF,  independent  of  array  size,  can  be 
demonstrated.  The  theoretical  improvement  is  nearly  an 
order  of  magnitude  better  than  redundant  arrays,  companion 
processor,  SRE,  or  tree  based  approaches.  The  virtual 
channel  technique  yields  a  theoretical  improvement  of 
0 . 7 (N) .  However,  the  practical  performance  is  a  function  of 
the  ratio  of  the  switching  mechanism  failure  rate  to  the 
processing  element  failure  rate.  The  improvement  is 
dependent  on  hardware  design  implementations. 

The  fault  tolerance  techniques  discussed  provide 
comparable  performance  for  low  throughput  levels.  In  fact, 
each  technique  converges  to  an  improvement  of  1.5  as  the 
number  of  processing  elements  approaches  unity  (assuming 
ideal  switching  mechanisms) .  Therefore,  for  low  throughput 
levels,  the  complexity  in  implementing  each  fault  tolerance 
technique  should  be  the  major  concern  when  cost  is  a 
priority . 

In  addition,  the  performance  of  each  fault  tolerance 
technique  analyzed  is  highly  dependent  on  the  reliability 
performance  of  the  switching  mechanism  used  in  the  design. 
This  dependence  cannot  be  overlooked  in  any  assessment.  The 
switch  design  is  an  important  consideration.  Generally,  in 
terms  of  failure  rate,  the  switch  must  outperform  the 
processing  element  by  at  least  one  order  of  magnitude  and 
preferably  more  than  two  orders  of  magnitude. 

The  results  reported  in  this  report  assume  that 
operational  processing  elements  and  hardware  reserves 
operate  over  the  same  time  period.  Additional  improvement 
can  be  sought  with  stand-by  connections  for  the  hardware 
reserves.  Stand-by  connections  would  eliminate  any  overlap 
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in  the  useful  life  of  operational  processing  elements  and 
reserves  to  maximize  the  total  MTBF  (reference  18)  . 

In  addition,  the  improvement  levels  outlined  are  not 
sufficient  to  recover  the  loss  in  reliability  due  to  array 
size.  The  design  of  highly  reliable  components,  i.e., 
processing  elements,  should  accompany  the  application  of 
fault  tolerance  architectures. 

Graceful  degradation  can  provide  a  cost  effective 
approach  to  fault  tolerance.  A  cost  versus  system 
performance  comparison  was  addressed  for  the  minimum 
variance  distortionless  response  beamformer. 

The  results  described  in  this  report  provide  only  a 
stepping  stone  in  the  path  of  maturation  for  reliable 
systolic  array  architectures.  Without  question,  there  are 
several  unresolved  issues  which  require  further  examination, 

i.e., 

1.  suitability  of  available  architectures  to  specific 
system  applications, 

2.  theoretical  versus  practicable  performance, 

3.  reliability  attainable  through  standby  operation, 
and 

4  . 


architecture  performance  validation  via  modeling 
and  simulation. 


APPENDIX  A 


PROOFS 

REDUNDANT  ARRAYS 

Pr (array  "A"  or  array  ”3"  works)  =  Pr ("A"  works  and  "B"  works) 

+  Pr ("A"  works  and  "B"  doesn't  work) 

+  Pr("A"  doesn't  work  and  "B"  works) . 

From  report:  Pr("A"  works)  =  Pr("B"  works)  =  e 

Pr  (array  "A"  or  array  "B”  works)  =  e  -n^-t  e  -nXt  +  e  -nkt  (i-e  -nkt) 

+  ( 1  -  e  )  e 

=  e  -2nXt  +  2e  -nXt  (l  -  e  nkt) 

=  e  -nkt  (2  -  e  -nkt) 

oo  oo 

MTBF  =  \  R  (t )  dt  =  1  [2e  -nXt  -  e  -2nkt]dt 

0  0 

MTBF  =  2/(Xn)  -  l/(2Xn)  =  1.5  (l/(Xn))  =  1.5  M 

Improvement  =  1.5. 

COMPANION  PROCESSORS 

Pr (companion  array  works)  =  Pr(all  switches  work)  [Pr (one 

processing  element  or  its  companion  work) ]r 

?r (one  processing  element  or  its  companion  works)  =  Pr(both  work) 

+  2Pr(one  works)Pr(the  other  doesn't  work) 
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Pr(one  processing  element  or  its  companion  works)  = 


e  ~kte  ~Xt  +  2e  ~Xt  (1  -  e  ~Xt)  =  R2  +  2R(1  -  R) 

Pr(companion  array  works)  =  e  “n^st  [e  _2^1'  +  2e  (1-e  "^■t)]n 


Pr(companion  array  works)  =  e  +  [2  -  e 


Pr  (companion  array  works)  =  e  +  [2  -  e  ^'t]n 


Pr (companion  array  works)  =  e  n^ft[2  -  e  _^t]n 


MTBF  = 


J  e  "nXft[2  -  e  -Xt]n 


i:  (A  +  B)n  =  Z/n\A(n_1)B1 


MTBF 


=  f  e  -nXft  Z  /n \2  (n_l)  (-1)  1  (e 


H 

i  =  0  Vi/ 


MTBF  =  |  E  /n\2  <n_i)  (-1)  xe  _X-t(l+nf) 
0  i=0\ i) 


MTBF 


=  Z  (  /n \2  <n_i)  ( -1 )  i>  /  a(i  +  nf) 


T) 

i=0  Vi/ 


MTBF 


=  l/(nX)  Z  /n  \2  (n_1)  (-1 )  1n  /  (i  +  nf) 


n 

i=0\ i  / 


s<- 9 


i'l  .i  I1*  1*L|'  Ij’lj  f.|‘l  I  ».|  *.4  »,<  ',1  *-l  M  *  I  _»«4  ».  4>,  n.4<t  tl»  41,  jl,  4U  |l.  4»«  At.  4*.  4  4>» 
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Improvement  =  X  /n\2(n-i)  (-1)1  n  /  (i+nf)  . 


n 

i=0 \i / 


SEQUENTIAL  ROW  ELIMINATION  (SRE)  AND  ALTERNATE 
ROW-COLUMN  (ARCE)  ELIMINATION  TECHNIQUES 


Given  a  discrete-state  continuous-transt ition  system 


which  has  exponentially  distributed  faults,  it  can  be  modeled  by  a 


Markov  model,  i.e., 


SRE  :  in  state  #0  n  X  n  elements  function. 


in  state  #1  (n-1)  X  n  elements  function. 


in  state  #2  (n-2)  X  n  elements  function,  etc. 


For  each  state  another  row  has  been  removed,  so  the  failure 


.ransition  rate  can  be  written  as  Ai  =  (n-i)nX.. 


ARCE  :  in  state  #0  n  X  n  elements  function 


in  state  #1  (n-1)  X  n  elements  function 


in  state  #2  (n-1)  X  (n-1)  elements  function,  etc. 


For  each  state  a  row  or  a  column  has  been  removed,  so  the  failure 


.ransition  rate  can  be  written  as 


Ai  =  (n  -  f ( i/2 ) 1)  (n  -  L(i/2) J> 


The  bars  signify  the  round-up  and  round-down  functions. 


h 

i 

1 

$ 

$ 


WWWW  wwwm’iwww.w  V  KWHF.  X».  K7HCT 
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Pr (system  is  in  present  state  for  At) 

=  Pr (system  was  in  last  state) 

+  Pr(no  change  in  state  will  occur), 

but 

Pr (system  was  in  last  state)  =  Pr  (last  state) Pr (transition  from 

last  state) , 

Pr (no  change  in  state  will  occur)  =  Pr  (current  state) Pr  (no 

transition) . 

By  using  the  above  results,  a  formula  for  both  SRE  and  ARCE  can  be 
derived  (reference  14),  i.e.. 

Pro  (t+At )  =  Prod  -  AoAt) 

Pri(t+At)  =  ProAoAt  +  Pri(l  -  AiAt ) 

Pr2 (t+At )  =  PriAiAt  +  Pr2(l  -  A2At) . 

Taking  the  derivative  with  respect  to  Dt  and  taking  the  limit 
as  At  — »  0  we  have 

Pr0' (t)  =  -A0Pr0 ( t ) 

Pri'  (t)  =  A0Pr0 (t)  -  AiPri (t) 

Pr2'(t)  =  AiPri (t)  -  A2Pr2(t) 

The  generalized  formula  is  Prk'(t)  =  Ak_iPrk_i  ( t )  -  AkPrk(t). 

Taking  the  Laplace  transform,  i.e., 

sPrk(s)  =  -AkPrk(s)  +  A^-xPr^-i ( s ) 

out. , 

sPr^-i (s)  =  -Ak-iPrk-i (s)  +  Ak-2Prk_2 (s) 


Prk_i (s) 


( Ak-2? rk-2 ( s )  )/ (s  +  Ak_ i )  . 


■nutwu  mnunurvwr 
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Therefore  the  inverse  Laplace  of 


k 

E 


1 


i=0  k 

(s  +  Ai)  FI  (Aj  -  Ai) 
j=0 
j*i 


X  - 

i=0  k 

IT  (Aj  -  Ai) 
j=0 
j*i 

which  leads  to  the  final  result  of 


Prjc(t) 


The  MTBF  is 


oo  k 


k-1 

n  Aj 

k  j=0 

E  - 

i=0  k 

n  (Aj  -  Ai) 

j=0 

j*i 

k-1 

n  Aj 

j=0 


|  I  -  e  -Ait  dt 

0  i  =  0  k 

n  (Aj  -  Ai> 


j=0 

j*i 


A- 6 


VIRTUAL  ARRAYS 


Pr (minimum  size  array  works)  =  Pr(all  switches  works 
*  [Pr(all  elements  work)  +  Pr(all  elements  except  one  work) 
+  ...+  Pr(all  elements  except  k  spares  work)] 


Pr(all  elements  work)  = 
Pr(all  elements  except  one  work) 


e 


- (n+k) Xt 


- (n+k-1) Xt 


Pr(all  elements  except  k  spares  work) 


=(n+k^e  nXt(l 


Pr  (all  switches  work)  =  e  -(n+k)^st 

K 

Pr  (minimum  size  array  works)  =  e"(n+lc>^st  X  /n+k\  e”  (n+lc_i)  ^  ( i_, 

i=0'  if 


K 


Pr (minimum  size  array  works) 


i=0' i  / 


Pr (minimum  size  array  works) 


K 

I  /n  +  k\  e  -t(n+k)f-i]Xt(1_e  -Xt 


APPENDIX  B 


PROOFS 


THIS  PROGRAM  WILL  CALCULATE  THE  IMPROVEMENT  FACTOR  OF 
COMPANION  FAULT  TOLERANCE  METHOD  GIVEN  THE  SWITCH  RATIO 
AND  THE  STEP  SIZE.  THE  STEP  SIZE  IS  NEEDED  TO 
CALCULATE  THE  INTEGRATION  BY  USING  THE  RIEMANN  SUM. 


PROGRAM  COMP 
IMPLICIT  NONE 


★★★★★★★★★★★★★ 


DECLARE  ALL  VARIABLES  THAT  WILL  BE  USED  IN  THE  PROGRAM  * 


REAL *4  PARTA,  Z,  X,  Y,  E,  L,  A,  B,  C,  DELTA,  N,  SR,  F,  R 
REAL  * 1 6  MTTF,  IMP (100) 


GET  THE  INPUT  FROM  THE  TERMINAL 


READ*,  SR,  DELTA 
F  =  1.  +  (l./SR) 

Q  ***★***★★★*★**★*★★**★★★★★★★★★*★**★★★*★**★*★★★★*★★*★★*★*★★★**★★★* 

C  *  LOOP  TO  GET  DIFFERENT  VALUES  OF  N,  WHICH  IS  THE  NUMBER  OF  * 
C  *  PROCESSING  ELEMENTS.  * 

C  *  THE  FOLLOWING  ROUTINE  CALCULATES  THE  FORMULA:  * 

C  *  °°  * 

C  *  }  e  -nft  (2  -  e  _t) n 

C  *  0  * 

( 2  **************************************************************** 


DO  N  =  0  .  , 144 . , 4 . 

MTTF  =  0. 

DO  L  =  0,  5.,  DELTA 

A  =  EXP ( -N*F*L ) 

R  =  EXP ( -L) 

B  =  2  .  -  R 
C  =  N/2. 

MTTF  =  MTTF  +  (  (A  *  B**C)  *  B**C) 

END  DO 
IMP (N)  = 


MTTF  *  N  *  DELTA 


0  ******************************************************************** 

C  *  PRINT  THE  OUTPUT  IN  TABLE  FORM  SO  THAT  THEY  CAN  BE  GRAPHED  LATER  * 

p  ********************************************************************* 


10 


PRINT  10,  N,  IMP (N) 
FORMAT (5X,F4 .0, 5X,  FI 0.4) 


END  DO 

STOP 

END 


PROGRAM 

SRE 

IMPLICIT 

NONE 

REAL 

DT,  DELTA,  PC, 

SR, 

R,  H,  E,  G,  S, 

REAL 

O,  B,  A 

REAL* 16 

PB,  Al,  A2,  ANSW, 

ANSW2,  ANSWER 

INTEGER* 4 

K,  I,  J,  Z,  N, 

w. 

D,  T,  COUNT 

CHARACTER 

AGAIN 

C 

C 

C 

C 

C 


**************************************************************** 

*  THIS  PROGRAM  CALCULATES  THE  IMPROVEMENT  FACTOR  OF  USING 

*  SEQUENTIAL  ROW  ELIMINATION  FAULT  TOLERANCE  METHOD,  GIVEN  * 

*  THE  SWITCH  RATIO  AND  THE  STEP  SIZE.  THE  INTEGERATION  IS  * 

*  DONE  BY  USING  SIMPSONS  RULE. 


0  ***************************************************************: 
AGAIN  =  'Y' 

DO  WHILE  (AGAIN  .EQ.  'Y') 

AGAIN  ='J' 

ANSWER  =  0. 


q  **************************************************************** 
C  *  GET  THE  INPUT  FROM  THE  TERMINAL  AND  CALCULATE  THE  STEP  SIZE.* 
0  **************************************************************** 

READ*,  N,  D 
H  =  0 
E  =  3. 

G  =  60. 

DELTA  =  (E-H) /G 
COUNT  =  -1 


B-2 


1 

l 

\i 

( L 

L 

,X_ 

•j; 

■> 

V. 

A 

*>' 

v 


3 

s 

% 


**************************************************************** 

*  THE  FOLLOWING  ROUTINE  CALCULATES  THE  FORMULA  :  * 

*  K-l  * 

*  n  Xi  * 


»  D  K 


e  ■*'it  dt 


j  £  £  - 

0  k=0  i=0  k 

n  (Xj  —  Xi) 

j=0 


★  ★★★★★★★★★★★★★★★★★★★sir******************************************* 

DO  DT  =  0,  3.,  DELTA 

COUNT  =  COUNT  +  1 
R  =  (E  -  H)  /  <G  *  3.) 

SR  =  100000  . 

Z  =  N*  *2 

PC  =  EXP ( (1/SR) *Z*-DT) 

ANSW2  =  0. 

DO  K  =  0,  D 

ANSW  =  0. 

DO  I  =  0,  K 

A1  =  1. 

A2  =  1. 

DO  J  =  0,  K-l 

B  =  D  *  (N  -  J) 

A1  =  A1  *  B 
END  DO 

A  =  D  *  (N  -  I) 

PB  =  EXP ( -A  *  DT) 

DO  J  =  0,  K 

IF  (J  .EQ.  I)  GOTO  10 
B  =  D  *  (N  -  J) 

A2  =  A2  *  (B  -  A) 

)  END  DO 

ANSW  =  ANSW  +  ((A1/A2)  *  PB) 

END  DO 

ANSW2  =  ANSW2  +  (ANSW*PC) 

END  DO 

IF  (IAND (COUNT, 1)  . EQ .  0)  THEN 

COEF  =  2  . 

ELSE 

COEF  =  4 . 

END  IF 

IF  (DT  .EQ.  0  .OR.  D  . EQ .  3.)  THEN 
COEF  =  1 . 

END  IF 

ANSWER  =  ANSWER  +  (ANSW2*COEF) 

END  DO 


m 


C  *  PRINT  OUT  RESULTS  OF  PROGRAM  * 


S  =  D  **  2 

PRINT  20,  S,  ANSWER*R*S 
FORMAT  (2X,F4.0,  F20.4) 
READ  30,  AGAIN 
FORMAT  (A1 ) 

END  DO 

STOP 

END 

PROGRAM  ARCE 
IMPLICIT  NONE 


REAL 

DT,  DELTA,  C,  PC, 

SR,  R,  H,  E,  G,  S, 

REAL 

0,  B,  A,  F 

REAL* 16 

PB,  Al,  A2,  ANSW, 

ANSW2,  PA,  ANSWER 

INTEGER* 4 

K,  I,  J,  Z,  N,  W, 

D,  T,  COUNT 

CHARACTER 

AGAIN 

C  *  THIS  PROGRAM  CALCULATES  THE  IMPROVEMENT  FACTOR  OF  USING 
C  *  ALTERNATE  ROW-COLUMN  ELIMINATION  FAULT  TOLERANCE  METHOD, 

C  *  GIVEN  THE  SWITCH  RATIO  AND  THE  STEP  SIZE.  THE  INTEGERATION  * 
C  *  DONE  BY  USING  SIMPSONS  RULE.  * 

q  ★★*★***★***★*★★★★★★★★★★★★★★★★★★★****★★★★★★*★★★****★★*★*★****★•**★ 

AGAIN  =  'Y' 

DO  WHILE  (AGAIN  . EQ .  * Y ’ ) 

AGAIN  ='J' 

ANSWER  =  0. 


2  ★  ★*★***★★★**★★★*★★★★★★★★★★*★★★*★*★***★**★*★★★★*★★****★*★*★■*★★*•*•* 
C  *  GET  THE  INPUT  FROM  THE  TERMINAL  AND  CALCULATE  THE  STEP  SIZE.* 
c  **************************************************************** 

READ*,  N,  D 
H  =  0 
E  =  3. 

G  =  60. 

DELTA  =  (E-H) /G 
COUNT  =  -1 


THE  FOLLOWING  ROUTINE  CALCULATES  THE  FORMULA  : 

K-l 

n  Jtj 

oo  D  K  j=0 

/  Z  Z  -  e  "Xit  dt 

0  k=0  i=0  k 

n  (X,j  —  Xj_) 

j=0 

j*0 


DO  DT  =  0,  3.  ,  DELTA 

COUNT  =  COUNT  +  1 
R  =  (E  -  H)  /  (G  *  3.) 

SR  =  100000. 

Z  =  N*  *2 

PC  =  EXP  (  (1/SR) *Z*-DT) 
ANSW2  =  0. 

DO  K  =  0,  D 

ANSW  =  0. 

DO  I  =  0,  K 


A1  =  1 . 

A2  =  1. 

DO  J  =  0,  K-l 
M  =  N  -  ( J/2  .  ) 

F  =  NINT(M) 

O  =  INT (M) 

A1  =  A1  *  F  *  O 
END  DO 

C  =  N  -  (1/2.) 

A  =  NINT(C) 

B  =  INT(C) 

PA  =  A  *  B 

PB  =  EXP ( -PA  *  DT) 

DO  J  =  0,  K 

IF  (J  .EQ.  I)  GOTO  10 
P  =  N  -  (J/2 . ) 

Q  =  NINT(P) 

L  =  INT(P) 

A2  =  A2  *  (  (L  *  Q)  - 
END  DO 

ANSW  =  ANSW  +  ((A1/A2)  *  PB) 

END  DO 

ANSW2  =  ANSW2  +  (ANSW*PC) 

END  DO 

IF  ( IAND (COUNT, 1)  . EQ .  0)  THEN 


END  IF 

IF  (DT  .EQ.  0  .OR.  D  .EQ.  3.)  THEN 
COEF  =  1. 

END  IF 

ANSWER  =  ANSWER  +  (ANSW2*COEF) 

END  DO 

Q 

C  *  PRINT  OUT  RESULTS  OF  PROGRAM  * 

Q 

S  =  (N  -  (D/2 .  )  )  **  2 
PRINT  20,  S,  ANSWER*R*S 
20  FORMAT  (2X,F4.0,  F20.4) 

READ  30,  AGAIN 
30  FORMAT  (Al) 

END  DO 

STOP 

END 

Q 

C  *  THIS  PROGRAM  CALCULATES  THE  IMPROVEMENT  FACTORS  OF  USING 
C  *  VIRTUAL  FAULT  TOLERANCE  METHOD  GIVEN  THE  SWTICH  RATIO  AND  * 

C  *  REDUNDANCY.  THE  INTEGRATION  OF  THE  FORMULA  IS  DONE  BY  * 

C  *  USING  SIMPSONS  RULE.  * 

0  ****************************************************************** 

PROGRAM  VIR 
IMPLICIT  NONE 


0  ★  *  *  ★  *  *  ★  *  *  ★  *  ★  ★  ★  *  ★  ★  ★  ★  ★  *  *  ★  ★  *  ★  *  *  *  *  *  *  ★  *  *  *  ★  ★  ★  ★  *  *  ★  ★  *  ★  *  ★  ★  *  ★  *  *  *  ★  *  ★  *******'^^‘ 

C  *  DECLARE  THE  VARIABLES  THAT  WILL  BE  USED  IN  THE  PROGRAM 

C  ****************************************************************** 

REAL* 16  CMB,  Z,  PARTB2 

REAL* 16  ANSW,  ANSWER,  PARTA,  PARTB,  PARTC,  PARTD,  PA,  PB 
REAL  R,  W,  U,  P,  F,  K,  SR,  Y,  D,  DELTA 

REAL  DT,  J,  CONST,  RK,  E,  A,  B,  CON,  S IMPSONS_CON,  COEFFICIENT 
INTEGER  X,  N,  I,  RED,  T,  COUNT 
DIMENSION  CMB (400, 402) 

LOGICAL  AGAIN 
CHARACTER  CHAR 


**★*★*****★*★★★★*★*★★★★*★★*★★★★***★★*★★★***★★**★★****★★***★*******•* 

*  THE  FOLLOWING  ROUTINE  CALUCLATES  PASCALS  TRIANGLE.  THE 

*  RESULTS  ARE  STORED  IN  AN  ARRAY,  SO  THAT  THEY  CAN  BE  USED  * 

*  LATER  IN  THE  MAIN  PROGRAM.  THE  ROUTINE  TAKES  ADVANTAGE  * 

*  OF  THE  SYMMETRY  OF  FASCALS  TRIANGLE. 


C 

C 

c 

C 

r 


DO  N  =  1, 300 

CMB (N, 1)  =  1 . 
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o  n 


CMB (N, 2)  =  N 
X  =  I AND (N, 1 ) 

IF  (X  .EQ.  0)  THEN 

DO  I  =  3, (N/2) +1 
R  =  N 
W  =  I  -  1. 

IF  (I  .EQ.  ( (N/2) +1) )  THEN 

CMB  (N,  I )  =  CMB(N,N+1-I)  *  (  (R+l.-W) /W) 
ELSE 

CMB (N, I )  =  CMB(N, 1-1) *( (R+l.-W) /W) 
END  IF 

END  DO 

ELSE 

DO  I  =  3, (N+l) / 2 
R  =  N 
W  =  I  -  1. 

CMB (N, I )  =  CMB (N,  1-1) *( (R+l.-W) /W) 

END  DO 

END  IF 

IF  (X  .EQ.  0)  THEN 

DO  I  =  (N/2) +2,  N+l 

CMB (N,  I )  =  CMB (N,  N+2-I) 

END  DO 

ELSE 

DO  I  =  (  (  (N+l) 12)  +1) ,  N+l 

CMB (N, I )  =  CMB (N,  N+2-I) 

END  DO 

END  IF 


END  DO 


*  GET  RID  OF  THE  UNSIGNIFICANT  VALUES  RIGHT  OF  THE  DECIMAL  POINT  * 
******************************************************************* 

DO  R  =  1  . ,  300  . 

DO  W  =  1 . , R  +  1 . 

CMB (R, W)  =  QNINT (CMB (R, W) ) 

END  DO 

END  DO 


******************************************************************* 

*  START  OF  THE  MAIN  PART  * 

*  SOLVING  THE  EQUATION  : 

*  * 

*  oo  K 

„  f  v  „  -  ( F  (N+K)  -  I|t„  _  -t,I 


AGAIN 


IF  (AGAIN  .EQ.  .FALSE.)  GO  TO  100 
READ*,  SR,  RED 
DELTA  =  .5 
PRINT  60,  SR, RED 

FORMAT (/5X, 'SWITCH  RATIO: * ,F7.0,3X, ' REDUNDANCY ', 14 ) 
DO  R  =  96. ,  144 . ,  8  . 

F  =  1.  +  (l./SR) 


IF  (RED 

•  EQ. 

25)  K  =  R/4 

IF  (RED 

.EQ. 

50)  K  =  R/2 

IF  (RED 

.EQ. 

75)  K  =  R  * 

IF  (RED 

.EQ. 

100)  K  =  R 

ANSWER  = 

0. 

RK  =  R  + 

K 

CONST  = 

RK  * 

F 

SIMPSONS 

CON 

=  1 .  /  6  . 

COUNT  = 

-1 

DO  DT  = 

0,  5, 

. ,  DELTA 

ANSW 

=  0. 

COUNT  =  COUNT  +  1 
E  =  EXP  (-DT) 

PARTC  =  1.  -  E 
DO  J  =  0 . , K 

IF  (J  .EQ.  0  )  THEN 
PARTB  =  1. 
PA  =  PARTB 


ELSE 

PARTB  =  E**  (  (CONST-J) /2.0) 
PA  =  (PARTC**  (J/2. )) *PARTB 

END  IF 

ANSW  =  ANSW  +  (CMB (RK, J+l . ) *PA*PA) 

★★★★★★★★★★★★★★★★★★★★★A************************************* 


VALUES  WHERE  DIVIDED  BY  2,  SO  THAT  THE  ROUND  OFF 
ERRORS  WOULD  BE  MINIMAL. 

★  ★★★★★★★★★★★★★★★★★★★★***AA********A**************A*****A*:k* 


END  DO 

IF  ( I AND (COUNT, 1)  .EQ.  0)  THEN 
COEFFICIENT  =  2. 

ELSE 

COEFFICIENT  =  4. 

END  IF 

IF  (DT  .EQ.  0  .OR.  DT  .EQ.  5.)  THEN 
COEFFICIENT  =  1 . 

END  IF 

ANSWER  =  ANSWER  +  SIMPSONS_CON*COEFFICIENT*ANSW 
END  DO 


PRINT  OUT  THE  RESULT  AND  LOOP  IF  THE  USER  RESPONDS 
POSITIVELY. 


PRINT  65,  R, ANSWER* SIMPSONS_CON 
FORMAT ( 5X, ' N  = ' , F4 . 0 , 2X, F2 0 . 4 ) 


END  DO 


READ  70,  CHAR 
FORMAT  <A1) 

IF  ( ICHAR (CHAR)  .EQ.  78)  THEN 
AGAIN  =  .FALSE. 

END  IF 
GO  TO  1 
STOP 
END 


B-9/B-10 
Reverse  Blank 


vw  y." w.»  wjnx7j(  »w  »jr»  v“  if  if  vvi>'  iVlVvv.v.wwv  v  'i 
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